Scaling Access to Heterogeneous Data Sources with Disco DRAFT NOT FOR DISTRIBUTION SEE TKDE FOR FINAL VERSION

نویسندگان

  • Anthony Tomasic
  • Louiqa Raschid
  • Patrick Valduriez
چکیده

Accessing many data sources aggravates prob lems for users of heterogeneous distributed databases Database administrators must deal with fragile mediators that is mediators with schemas and views that must be sig ni cantly changed to incorporate a new data source When implementing translators of queries from mediators to data sources database implementors must deal with data sources that do not support all the functionality required by me diators Application programmers must deal with graceless failures for unavailable data sources Queries simply return failure and no further information when data sources are unavailable for query processing The Distributed Informa tion Search COmponent Disco addresses these problems Data modeling techniques manage the connections to data sources and sources can be added transparently to the users and applications The interface between mediators and data sources exibly handles di erent query languages and dif ferent data source functionality Query rewriting and op timization techniques rewrite queries so they are e ciently evaluated by sources Query processing and evaluation se mantics are developed to process queries over unavailable data sources In this article we describe a the distributed mediator architecture of Disco b the data model and its modeling of data source connections c the interface to un derlying data sources and the query rewriting process and d query processing semantics We describe several advan tages of our system

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Effect of a novel BKCa opener on BKCa currents and contractility of

Running head: 3 GoSlo-SR5-130 opens BKCa and relaxes corpus cavernosum 4 Authors: 5 K.I. Hanniganconception and design of research, data acquisition, data analysis, 6 interpretation of results, writing draft manuscript, preparation of 7 figures, editing manuscript, approval of final version of manuscript. 8 R.J. Largedata acquisition, data analysis, approval of final version of manuscript. 9 E....

متن کامل

توزیع چندگانگی ذرات باردار در نابودی +e-e در انرژی مرکز جرم GeV54-57 و مقیاس KNO

In this paper, we investigate the multiplicity of charged particles in e+ e– annihilation by using different models. To achieve this we first fit the multiplicity distribution of charged particles in the energy range of 54-57 GeV by using both the Poisson distribution and KNO scaling, then we compare these results with multiplicity distribution at the lower energies. This comparison shows that...

متن کامل

ROHDIP: Resource Oriented Heterogeneous Data Integration Platform

During the last few years, the revolution of social networks such as Facebook, Twitter, and Instagram led to a daily increasing of data that are heterogeneous in their sources, data models, and platforms. Heterogeneous data sources have many forms such as the www, deep web, relational databases systems, No-SQL database systems, hierarchal data systems, semistructured files, in which data are us...

متن کامل

افزایش سرعت نگهداری افزایشی دید با استفاده از الگوریتم فاخته

Data warehouse is a repository of integrated data that is collected from various sources. Data warehouse has a capability of maintaining data from various sources in its view form. So, the view should be maintained and updated during changes of sources. Since the increase in updates may cause costly overhead, it is necessary to update views with high accuracy. Optimal Delta Evaluation method is...

متن کامل

DISCO Nets: DISsimilarity COefficient Networks

We present a new type of probabilistic model which we call DISsimilarity COefficient Networks (DISCO Nets). DISCO Nets allow us to efficiently sample from a posterior distribution parametrised by a neural network. During training, DISCO Nets are learned by minimising the dissimilarity coefficient between the true distribution and the estimated distribution. This allows us to tailor the training...

متن کامل

From Gumbel to Tracy-Widom

The Tracy-Widom distribution that has been much studied in recent years can be thought of as an extreme value distribution. We discuss interpolation between the classical extreme value distribution exp(− exp(−x)), the Gumbel distribution, and the Tracy-Widom distribution. There is a family of determinantal processes whose edge behaviour interpolates between a Poisson process with density exp(−x...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998